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A CONNECTION BETWEEN THE GHIRLANDA GUERRA 
IDENTITIES AND ULTRAMETRICITY^ 

By Dmitry Panchenko 

Texas A&M University 

We consider a symmetric positive definite weakly exchangeable 
infinite random matrix and show that, under the technical condition 
that its elements take a finite number of values, the Ghirlanda-Guerra 
identities imply ultrametricity. 

1. Introduction and main result. Let us consider an infinite random 
matrix R = {Rij')ij'>i which is symmetric, nonnegative definite [in the 
sense that {Ri,i')i<i,i'<n is nonnegative definite for any n > 1] and weakly 
exchangeable, which means that for any n > 1 and any permutation p 
of {l,...,n}, the matrix {Rp{i),p{v))i<i,v<n has the same distribution as 
{Ri,v)i<i,v<n- Following [6], we will call the matrix with such properties 
a Gram-de Finetti matrix. We assume that diagonal elements Ri^i = 1 and 
nondiagonal elements take only a finite number of values, 

(1.1) P(i?i 2 = gz) = "^m - "iz 

for 1 <l <k and for some — 1 < < (72 < • ' • < % ^ 1 and = mi < ■ ■ ■ < 
rrik < wifc+i = 1- We say that the matrix R satisfies the Ghirlanda-Guerra 
identities [7] if, for any n > 2, any bounded measurable functions / : M"("~i)/2 
R and V : K IR, 

1 1 " 

(1.2) E/„V(iii,n+i) = -E/„E^(i?i,2) + - Ve/,V(^i,z), 

1=2 

where = f{{Ri,v)i<i<v<n)- In other words, conditionally on iRi,i')i<i<i'<n, 
the law of -Ri,n+i is given by the mixture n~^C{Ri^2) + ''T'~^Y1?=2^Ri r 
the positivity principle proven by Talagrand (Theorem 6.6.2 in [14], see also 
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[9]), the Ghirlanda-Guerra identities imply that Ri^2 ^ with probabihty 
1 and, therefore, from now on, we can assume that qi > 0. However, this a 
priori assumption is not necessary since it wih also be clear from the proof 
that qi must be nonnegative. The main result of the paper is the following. 

Theorem 1. Under assumptions (1.1) and (1.2), the matrix R is ultra- 
metric, that is, 



This question of ultrametricity originates in the setting of the Sherrington- 
Kirkpatrick model [13], where R corresponds to the matrix of the overlaps, 
or scalar products, of i.i.d. replicas from a random Gibbs measure. The ul- 
trametricity property (1.3) was famously predicted by Parisi in [8] as a part 
of complete description of the expected behavior of the model and it still 
remains an open mathematical problem. On the other hand, the Ghirlanda- 
Guerra identities, which are implicitly contained in the Parisi theory, were 
proven rigorously in [7] in some approximate sense; namely, one can slightly 
perturb the parameters of the model such that, on average over the pertur- 
bation (or for some specific choice of perturbed parameters, see [15]), the 
Ghirlanda-Guerra identities hold in the thermodynamic limit. In this pa- 
per, we consider an asymptotic distribution of the overlap matrix for which 
the Ghirlanda-Guerra identities hold precisely, as in (1.2), and show that 
they automatically imply ultrametricity, under a technical condition (1.1). 
In some sense, the main idea of the paper is nothing but a reversal of the 
proof of the Ghirlanda-Guerra identities, which arise from the information 
provided by the "stochastic stability" of the system with regard to small per- 
turbations of the parameters. Our main technical contribution, the invari- 
ance principle of Theorem 4, is a very specific form of the stochastic stability 
of the system implied by the Ghirlanda-Guerra identities. The Parisi theory 
for the Sherrington-Kirkpatrick model also predicts that the distribution of 
the overlap Ri^2 bas a nontrivial continuous component, so the assumption 
(1.1) is rather restrictive. Nevertheless, Theorem 1 provides some hope that 
the Ghirlanda-Guerra identities imply ultrametricity in the general case as 
well. 

This work was motivated by a paper of Arguin and Aizenman [3] and, 
in particular, by a beautiful application of the Dovbysh-Sudakov represen- 
tation theorem [6] for Gram-de Finetti matrices that will play the same 
crucial role here. In [3], the authors prove ultrametricity in a slightly differ- 
ent setting as a consequence of what they call the "robust quasi-stationarity 



(1.3) 



P(i?2,3>min(i?i,2,i?i,3)) = l. 



(1.4) 
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property." The main idea in [3] utilizes the robust quasi-stationarity in or- 
der to prove "quasi-stationarity under free evolution" at each step of the 
inductive argument, which, in turn, implies weak exchangeability and, via 
the application of the Dovbysh-Sudakov representation, induces clustering 
of the type (1.4). Our proof is based on exactly the same idea. The differ- 
ence now is that quasi-stationarity under free evolution — the invariance or 
stochastic stability property of Theorem 4 below — will be a consequence of 
the Ghirlanda-Guerra identities and, of course, the induction in the proof 
of Theorem 1 will be different since it is also based on (1.2). In addition, we 
give a new proof in Theorem 3 below that the invariance implies exchange- 
ability, which is based on the explicit control of the mixing induced by the 
random permutation in the invariance principle. 

Simultaneously with the present work, Talagrand developed a different 
approach to Theorem 1 in [15] based on the Ghirlanda-Guerra identities 
and a form of invariance. Theorem 4 below shows that sufficient invariance 
is already contained in the Ghirlanda-Guerra identities and one can now 
find a new more direct proof of Theorem 4 in [15]. In addition, [15] clar- 
ifies the physicists' idea of decomposing the system into pure states and 
explains how both the Ghirlanda-Guerra identities and invariance arise in 
the Sherrington-Kirkpatrick model. 

The rest of the paper is organized as follows. In the next section, we 
start with the Dovbysh-Sudakov representation result which shows that 
any Gram-de Finetti matrix R can be generated by i.i.d. replicas from some 
random Gibbs measure on a separable Hilbert space, which is called the 
directing measure of R, in almost exactly the same way as the overlap matrix 
is generated by the Gibbs measure in the Sherrington-Kirkpatrick model. 
We first study some basic properties of the directing measure which follow 
from the Ghirlanda-Guerra identities; namely, that it always concentrates 
on a nonrandom sphere of the Hilbert space and is either continuous or 
discrete with probability 1. In particular, it is discrete when (1.1) holds. 
In Section 3, we formulate the invariance and exchangeability properties of 
the configuration of the directing measure and use them to prove Theorem 
1 by induction on k. Finally, in Section 4, we show how the Ghirlanda- 
Guerra identities imply the invariance of the directing measure and how the 
invariance implies weak exchangeability. 

2. Basic consequences of GGI and exchangeability. Since all of the prop- 
erties of the matrix R = {Ri^i')i^ii>i considered above — symmetry, positive 
definiteness, weak exchangeability, satisfying the Ghirlanda-Guerra iden- 
tities (GGI) — were expressed in terms of its finite-dimensional distribu- 
tions, we can think of i? as a random element in the product space M = 
Y\i<i ir[—lA] with the pointwise convergence topology and the Borel a- 
algebra 7W. Let V denote the set of all probability measures on A4. Suppose 
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that FeV is such that for all AgM, 




where Q : 17 x 7W — [0, 1] is a probabihty kernel from some probabihty space 
J",Pr) to M such that: (a) Q{u},-) ^'P for all w G (b) Q{-,A) is mea- 
surable on for all A G A^. In this case, we will say that P is a mixture of 
laws Under (2.1), we can write the expectation of any measurable 

P-integrable function : M — )• M as 



We will say that a law Q G "P of a Gram-de Finetti matrix is generated by 
an i.i.d. sample if there exists a probability measure i] on H x [0, oo), where 
-ff is a separable Hilbert space, such that Q is the law of 



where {xi,ai) is an i.i.d. sequence from ij and x ■ y denotes the scalar product 
on H. The analysis of the distribution of R will utilize the following repre- 
sentation result for Gram-de Finetti matrices due to Dovbysh and Sudakov 



Proposition 1. A law ¥ gV of any Gram-de Finetti matrix is a mix- 
ture (2.1) of laws in V such that for all lo gQ, Q(a;,-) is generated by an 
i.i.d. sample. 

We will denote by r]^ a probability measure on H x [0, oo) corresponding 
to Q(a;, •) and let /i^ be the marginal of rj^^ on H. Following the terminology 
of Aldous [2], we will call fii^ the directing measure of the matrix (i?;^//). 
The main result of this section shows that the Ghirlanda-Guerra identities 
imply the following basic geometric properties of the directing measure. 

Theorem 2. Let R be a Gram~de Finetti matrix on M that satisfies the 
Ghirlanda-Guerra identities (1.2) and let F be the law of Ri^2- If Q* the 
largest point of the support of F, then, for Vi-almost all lo: (a) ;Utj(||j;|p = 
q*) = 1; (h) fi^ is continuous if F{{q*}) =0; (c) fi^) is discrete if F{{q*}) > 
0, in which case the sequence of weights of has the Poisson-Dirichlet 
distribution PD{1 - F{{q*})). 

We recall that, given s G (0,1), if {ui)i>i is the decreasing enumeration 
of a Poisson point process on (0,oo) with intensity measure x~^~'^ dx and 
"Wi = '^i/Ylij'^j-' then the distribution of the sequence {wi) is called the 
Poisson-Dirichlet distribution PD[s). If s = 0, then we define PD{0) to be 
the trivial distribution with wi = 1. The proof will be based on the following 
consequence of the Ghirlanda-Guerra identities. 



(2.2) 




(2.3) 



{xi ■ XV + ai5i^it)i^i,>i 



[6]. 
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Lemma 1. Consider a measurable set A C [—1, 1]. For Pr-almost all oj: 

(a) if F{A) > 0, then for fi^-almost all xi, fJ-u}{x2 : xi ■ X2 € A) > 0; 

(b) ifF{A)=0, then for n^-almost all xi, iJ.uj{x2- xi ■ X2 £ A) = 0. 

Proof, (a) Let m = F{A'^) < 1. Then, by the Ghirlanda-Guerra iden- 
tities (1.2), 

P(V2 <l<n + l, Ri^i G A"") = E/(V2 <l<n, Ri^i G G A^) 

n — 1 + 



(by induction on n) 



<l<n,RiieA'') 

n 

(n — 1 + m) ••■(! + m)m 
n! 

m(l + m)/ m\ f m 
' 1 + ^ ••• 1 + 



n V 2/ V 77,-1 



/ • 1 , / + mlogn 777(1 + 771) 
(using I + X < e"^) < ^ igmiogn ^ / ^ 

77 72^ 

Since tti < 1, letting tt- — )• +00 gives P(V2 < l,Ri^i G ^4*^) = 0. By Proposition 
1, P is a mixture (2.1) of measures generated by an i.i.d. sample from r]^^ 
and we can write 

<l,Ri,i£A^)= [ fi^^^^^\y2<l,xi-xi£A^)dFi{uj) = 0, 
Jn 

which implies that for Pr-almost all w, by Fubini's theorem, 

^^u'^'-^H{xl)l>2 : V/ > 2, XI • X2 G A') dfiU^i) = 0. 



This implies that for ^Li^-almost all xi, 

/^®^'-'^((^/)i>2 : V/ > 2,xi • X2 G A') = ^^(x2 :xi • X2 G A^^ = 0, 

which means that fiui{x2 -Xi • X2 £ A'^) < 1 and, therefore, ^u){x2 :xi • X2 G 
A) > 0. To prove part (b), it is enough to express P(-Ri,2 G j4) = using 
Proposition 1. □ 

Proof of Theorem 2. (a) Since F([-l,g*]) = 1 andF([g* -n~^,q*]) >0 
for all ri > 1, Lemma 1 implies that for Pr-almost all w, for /x^-almost all 

^Xuj{x2 : Xi • X2 < q*) = 1, /ia;(2;2 '-Xl' X2>q* - 77"^) > 

(2.4) 

V77> 1. 

Let us fix any such uj. First, the equality in (2.4) implies that /i^^dlxp < q*) = 1. 
Otherwise, there exists h £ H with ||/7|p > q* such that ^^{Bf;{h)) > for 
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any e > 0, where Bi,{h) is the bah of radius e > centered at h. Taking 
e > smah enough so that xi ■ X2> q* for all xi,X2 € Bi;{h) contradicts 
the first equality in (2.4). Next, let us show that /ia;(||a^|P < q*) = 0. Oth- 
erwise, there again exists an open ball Bf,{h) C {x: < g*} such that 
fi'uj{Bs{h)) > 0. For some 5 > 0, ||x|p < q* — 6 for all x G B^{h) and, there- 
fore, for all xi G B^{h) and X2 G < g*}, we have xi-X2 < \J q*[q* — 6) < 
q* — n~^ for large enough n > 1. Since we have already proven that /x^(||x|p < 
q*) = 1, this contradicts the second inequality in (2.4). We have proven that 
^i^{\\xf = q*) = l. 

(b) If F{{q*}) = 0, then, by Lemma 1 for Pr-almost all w, for /i^^-almost 
all xi, we have Huj{x2 :xi ■ X2 = q*) = 0. Therefore, for Pr-almost all w for 
which also ^i^(||3;|p = q*) = 1, /i^^ must be continuous. 

(c) If F{{q*}) = 1, then, by Lemma 1 for Pr-almost all uj, for /i^^-almost 
all xi, we have fiui{x2 :x\-X2 = q*) = 1- By part (a), /Uojdl^^lP = <?*) = 1 and, 
therefore, must be concentrated on one point. If F{{q*}) G (0, 1), then for 
Pr-almost all lo, for /x^^-almost all xi, we have ^iu){x2 : xi • X2 = q*) > and 
since /ia;(||a;|P = q*) = 1, we get that for /i^-almost all xi, /Xa;({xi}) > 0. This 
proves that /i^ is discrete. Let (wi) be the sequence of weights of fi^} arranged 
in decreasing order [we keep the dependence of (wi) on u) implicit]. The fact 
that (wi) has PD{1 — F{{q*})) distribution will follow from the analog of the 
Ghirlanda-Guerra identities for the Poisson-Dirichlet distribution proven by 
Talagrand in Chapter 1 of [14]. Let us explain how this result applies in our 

setting. Let us take any m > 1 and ni, . . . , rim ^ 1 and for n = ni-\ h rim, 

consider a function f on M which is the indicator of the set 

{Rl^l' = q* :ni + ■ ■ ■ + Up + 1 < I ^ I' < ni + ■ ■ ■ + Up + n^+i, <p<m—l}. 

Let us express the expectation E/ using (2.2) and write the inside integral 
in terms of the weights (wi) of the directing measure fi^^. Since, by part 
(a), /itj(||x||^ = q*) = 1 and Ri i' = xi ■ x;/ = q* only if xi = X2 when ||xi||^ = 
11^2 11^ = 9*5 by Fubini's theorem, 

(2.5) / /(i?)Q(a;,dii) = 

i>i i>i i>i 
Similarly, if we take ip{x) = I{x = q*), then 



(2.6) / mRi,n+i)Qioj, dR) = wf^+' E < • • • E 

1>1 1>1 1>1 

and we have fip{Rij) = f for 2 < j <ni and 

(2.7) I /^(i2i,)Q(a;,di?) = E<^"^---E^r^"E<'"---E 



i>i i>i i>i i>i 



GHIRLANDA-GUERRA IDENTITIES AND ULTRAMETRICITY 7 



for ni + 1 <j <n when ni + ■ — \-np + l<j<ni-\ \-np-\- np+i . If we let 

i>i i>i i>i 

and s = 1 - F{{q*}), then plugging (2.5), (2.6) and (2.7) into (1.2) implies 
that 

S{ni + l,n2,...,nm) = — — -S{ni, . . . ,nm) 

n 

m 

Tip, . . . , np-\-\ ) • • • ) f^m ) • 

p=2 " 

This coincides with equation (1.52) in [14]. It is explained there that this 
equation can be used recursively to compute S{ni, . . . ,nm) in terms of s G 
(0,1) only and that the set of numbers S{ni, . . . ,nm) uniquely determines 
the distribution of the weights (w/)/>i as the Poisson-Dirichlet distribution 
PD{s). □ 

3. Ultrametricity in the discrete case. In the case when (1.1) holds, The- 
orem 2 implies that for Pr-almost all uj, the directing measure is discrete 
and concentrated on the sphere of radius ^/qk, that is, 

(3.1) iiuj = ^wi6^(^i) 

i>i 

for some distinct sequence ^(/) G H with ||C(OlP = Qk and > tt;2 > • • • > 0. 
In particular, by (2.3), this implies that a nondiagonal element Ri^i' = xi ■ 
xi' = qk if and only if xi = xi' which proves the ultrametricity at the last level 
k, 

(3.2) Ri,2>qk, Ri,3>qk =^ R2,3>qk- 

This is already a serious achievement, based, in addition to (1.1), only on the 
weak exchangeability of the matrix R via the application of the Dovbysh- 
Sudakov representation. Bringing this idea to light was one of the main 
contributions of [3]. The description of the directing measure /i^^ provided 
by (3.1) corresponds to the pure states picture in physics, where each point 
^(/) can be thought of as a "pure state" of the asymptotic Gibbs measure. 
One of the crucial results in the alternative approach of Talagrand in [15] 
is a very general construction of pure states for measures in Hilbert spaces 
which provides a way around the representation results for exchangeable 
arrays that we are relying on here. 

Since we will hereafter deal with the discrete directing measure (3.1), let 
us introduce some notation that will be more convenient throughout the rest 
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of the paper. We will keep the dependence of the directing measure fi^j on 
u implicit and write E for the integration in u. To describe an i.i.d. sample 
from measure fi in (3.1), consider i.i.d. random variables 

(3.3) o■^o■^...GN 

that take any value I > 1 with probability wi, which is the weight corre- 
sponding to the index / in the directing measure (3.1). Let us denote by (•) 
the expectation in these random indices for a given measure /i, that is, for 
any n > 1 and a function h : N" — )■ M, 

(3.4) {h) = {hia\...,a''))= J] h{h, . . . ,ln)wi, ■ ■ ■ wi„. 

ii,...,i„ 

By the configuration of we will understand the weights and configuration 
of its atoms, 

(3.5) w = {wi)i>i and TZ = m ■ C{l'))i,i'>v 

Since ^{cr^) are i.i.d. from distribution fi, Proposition 1 can be rephrased 
by saying that the nondiagonal elements of the matrix R can be generated 
by first generating a random measure fj, [or its configuration (3.5)], then 
sampling indices (3.3) and setting 

(3.6) Rii,=^{a')-a'T'') = n^i,^i'- 

The Ghirlanda-Guerra identities (1.2) can be rewritten as 

1 1 " 

(3.7) E{fnij{Rl,n+l)) = -mn)m{Rl,2)) + - V E(/„^(/?i,;)) 

1=2 

and (1.1) can be rewritten as 

(3.8) ¥.{I{Ri^2 = qi)) =mi+i-mi for 1 < / < fc. 

The fact that in (3.1), all ||'^(/)|P = Qk implies that in (2.3), all ai = l — qk and, 
therefore, we can safely omit the term aidi^u in (2.3) and redefine the matrix 
R by Ri^i' = ^(cr') • ^(o"'') for all 1,1' >1 so that, from now on, the diagonal 
elements are equal to q^- As mentioned in the Introduction, as in [3], the 
crucial step which will allow us to use the Dovbysh-Sudakov representation 
(2.3) in order to make the induction step in the proof of Theorem 1 is the 
following. 

Theorem 3. The matrixTZ in (3.5) is weakly exchangeable conditionally 
on w = (wi). 
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Of course, this means that TZ is also weakly exchangeable uncondition- 
ally, which is how it will be used in the proof of Theorem 1 , but the proof of 
the stronger statement of conditional exchangeability is exactly the same. 
The proof of Theorem 3 will be based on a certain invariance property 
of the joint distribution of w and TZ — quasi-stationarity under free evolu- 
tion in the terminology of [3] — which, in our setting, will follow from the 
Ghirlanda-Guerra identities. Consider i.i.d. Rademacher random variables 
(e;);>i independent of the measure Given t > 0, consider a new sequence 
of weights 



(3.9) wj 



defined by a random change of density proportional to e*^' . Of course, these 
weights are not necessarily decreasing anymore, so let us denote by (wf) the 
weights (wj) arranged in decreasing order and let vr : N — )• N be the permuta- 
tion keeping track of where each index came from, wf = • Let us define 
by 

(3.10) ^,- = Y^wf6^^^^i)^ and = (e(vr(/)) • e(vr(/')));,F>i 

i>i 

the probability measure after the change of density proportional to e*^' and 
the matrix TZ rearranged according to the reordering of weights. Analogously 
to (3.4), let us denote by {hn)^ the average 

(3.11) {h)^= h{h,...,ln)wl---wl 

h,...,l„ 

and let E now denote the expectation in the randomness of the measure 
/i and the Rademacher sequence (e/). Theorem 3 is a consequence of the 
following invariance principle. 

Theorem 4. For t < 1/2, we have {w'^ ,1^^) = {w,TZ). 

This result can be expressed by saying that the directing measure /U is 
stochastically stable under the change of density (3.9) and is a nontrivial 
consequence of the Ghirlanda-Guerra identities; in fact, as mentioned in 
the Introduction, this is, in some sense, a reversal of the usual derivation of 
the Ghirlanda-Guerra identities. There is a very important technical reason 
why we use Rademacher instead of the more obvious Gaussian change of 
density (as in [3]) that will become clear from the proof. Of course, as will 
be shown in the proof of Theorem 3, Theorem 4 for Rademacher change of 
density (3.9) implies the same result for Gaussian change of density as well. 
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Let us now show how Theorem 3 can be used to prove ultrametricity by 
induction on k. 

Proof of Theorem 1. Given a Gram-de Finetti matrix R, we consider 
the configuration matrix TZ of its directing measure defined in (3.5), which 
is symmetric and nonnegative definite. By (3.1), TZi^i = qk and by (1.1), with 
probabihty 1, nondiagonal elements 

(3.12) ni^i,e{qi,...,qk-i}. 

By Theorem 3, TZ is weakly exchangeable and, thus, is a Gram-de Finetti 
matrix. Therefore, using Proposition 1, there exists a random probability 
measure rj' on H x [0,oo) such that 

(3.13) ini,i>) = {yryu + bi5i,i>), 

where {yi,bi) is an i.i.d. sequence from the distribution rj' . Let us now 
show that (3.12) implies that if fi' is the marginal of r]' on H, then fi' = 
"^i^i w[5^i(i-^ for some distinct sequence £ H, w[ > W2 > ■ ■ • > 0, and, 
moreover, 

(3.14) C'(0-e'(O, U'{l)fe{qi,...,qk-i}. 

Indeed, if a point y belongs to the support of fi' , in the sense that ^\B^{y)) > 
for all e > 0, then in the i.i.d. sequence (yi) from this distribution, there 
will be infinitely many elements from Bf,{y). Since, by (3.12), the scalar 
product yi ■ yi' of these elements belongs to {qi, . . . ,qk-i} with probability 
1, letting e — )• proves that ||y|p G {gi, . . . ,qk-i}- If z is another point in 
the support of /i' such that z G B^{y)^ then ||z|p G {qi, . . . , qk-i] and, there- 
fore, = if £ is small enough. The same argument also proves that 
y • z G {qi, . . . , qk-i}, which implies that y = z if e is small enough. This 
proves that the measure /i' is discrete and (3.14) holds. [Remark: Note that 

(3.14) guarantees that qk^i > and by a forthcoming induction, one can 
similarly conclude that all qi >0, which means that we did not need to 
invoke Talagrand's positivity principle and assume that qi > 0.] 

Let us now explain the induction step. If r~ = min(r, gfc_i), then the 
truncated matrix R~ = (R^i/) is symmetric, weakly exchangeable and, ob- 
viously, automatically satisfies the Ghirlanda-Guerra identities (1.2). Also, 
since Ri^i' = TZ^i and R^^, = 7?.~ , R~ is nonnegative definite whenever 

TZ'^ is, and the fact that TZ~ is nonnegative definite can be seen as follows. 
By (3.13), (3.14) and since TZi^i = q^ by (3.1), we get 

(3.15) (7^,",,) = {yi ■ yv + (%-i - \\yif')5i^v). 

Since, by (3.14), < qu-i-, the right-hand side is obviously nonnegative 

definite. This implies that 1Z~ and, therefore, R~ are nonnegative definite 
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and we have proven that the truncation R~ is again a Gram-de Finetti 
matrix that satisfies the Ghirlanda-Guerra identities. Finahy, the elements 
of R~ take k — 1 values {qi, . . . , qk~i}, 

(3.16) P(fi^2 = Qk^i) = P(i?i,2 > Qk^i) = 1 - ruk^i 
and, for I <k — 2, 

(3.17) P(i?i-2 = qi) = P(^i,2 = qi) = mi+i - mi. 

By the induction assumption, the matrix R~ is ultrametric and together 
with (3.2), this completes the proof of Theorem 1. □ 

It is known [5] that an ultrametric matrix R that satisfies the Ghirlanda- 
Guerra identities must be generated by the directing measure defined via 
the so called Derrida-Ruelle probability cascades [4, 11]. 

4. Invariance and exchangeability. It remains to prove the main con- 
sequences of the Ghirlanda-Guerra identities for the configuration of the 
directing measure /i: a form of the stochastic stability of Theorem 4 and its 
application to the exchangeability of Theorem 3. In the proof below, we will 
need a couple of well-known properties of the Poisson-Dirichlet distribution 
PD(s) which we will now recall. 

By Theorem 2(c), the sequence (wi) in (3.1) has the Poisson-Dirichlet 
distribution PD{s) with s = and it is defined by wi = ui/Ylip>i '^pi where 
(m;);>i is the decreasing enumeration of a Poisson point process on (0,oo) 
with intensity measure x~^~'^ dx. Let us consider an i.i.d. sequence {Xi,Yi) 
on (0,oo) X M, independent of {ui) and such that EXi < oo. Note that Xi 
and Yi need not be independent and, for example, Xi can be a function of 
Yi. Let (y/) be an i.i.d. sequence independent of everything else such that 
for any measurable bounded function (j), 

(4,1) 

which means that the distribution of Yl is the distribution of Yi under the 
change of density /EX|. Let be a random permutation of integers such 
that {uq(j^-^Xq(i-^) is arranged in decreasing order. The first property that will 
be useful (see Proposition 2.3 in [11] or Proposition 6.5.15 in [14]) states 
that 

(4.2) {uei^i-)Xe^i)) = mXif/'ui) 

and, in particular, the sequence of weights w[ = uiXi / Y^^-^^UpXp, after re- 
arranging in decreasing order, {w'^fj^^), again has the Poisson-Dirichlet dis- 
tribution PD{s). A more subtle result. Proposition A. 2 in [4] (this result 
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was rediscovered a couple of times — see Proposition 3.1 in [12] or Lemma 
1.1 in [10]) implies that 

(4.3) (ne(o^e(z),>e(o) = mxlY'^nuYl) 

and, in particular, (w^j^^^, Fg^;)) = {wi,Yl). This property holds in more gen- 
erality, but the case of real- valued (1;) will be sufficient for our purposes. 

Proof of Theorem 4. Given n > 2, let us consider a function /„ = 
f{{Ri,i')i<i<i'<n), where Ri^i' = ^(cr') • ^(cr''), and suppose that ||/||oo < 1- 
Consider a function ip{t) = 'E{fn)iv The central idea of the proof is to show 
that 

(4.4) ^{t)=m fort<^, 

which means that a weakly exchangeable matrix {Ri^i')i^i'>i has the same 
distribution under the directing measures fi'^ and /i. After we prove (4.4), 
we will explain how this implies that the configurations of random measures 
H and have the same distribution, which is precisely the statement of the 
theorem. If, for l>2, we let 

(4.5) A; = e{a') + ■■■ + e{a'-^) -{I- l)e{a') 

[we will write £{l) instead of £/], then it is easy to see that (p'{t) = E(/„A„+i)7r 
and, more generally, 

y,W(t) = E(/„A„+i---A„+fc)^. 

Since [A^j < 2Z, we get \^p'-''\t)\ < 2'' {n + k)] / n\ . If we can show that (^(^^(0) = 
for alU > 1, then 

k\n\ 

and letting k — )■ -|-oo implies that 93 (t) = 93(0) for t < 1/2. This is a good time 
to mention why we used a Rademacher sequence in the change of density 
(3.9) instead of the more obvious Gaussian. In the latter case, using Gaussian 
integration by parts, it is very easy to show that the Ghirlanda-Guerra 
identities imply (/^^'^ (0) = 0; moreover, the Aizenman-Contucci identities [1] 
suffice here. However, the problem of controlling the derivatives [(^('^^(t)! 
becomes extremely difficult. Using bounded Rademacher random variables 
in the change of density gives us control of these derivatives for free, but it 
transfers the difficulty to showing that 99^') (0) = 0, which we now address. 
To complete the proof of (4.4), it remains to show that for all /c > 1, 



(4.6) 



9pW(o)=E(/„A„+i---A„+,.)=0. 



GHIRLANDA-GUERRA IDENTITIES AND ULTRAMETRICITY 13 

Since for t = 0, fi'^ = n and, thus, (•) is independent of the Rademacher 
sequence 

(^('=)(0)=E(/„E,A„+i---A„,+fc), 

where denotes the expectation in Rademacher random variables only. If 
k is odd, then the derivative is zero by changing (si) — )• (—£/)• From now 
on, we will assume that k is even. It is obvious that E^A^+i • • • A„+fc is the 
function of (/(o"' = o"' ))i<i' only and, by (3.1), cr^ = if and only if Ri it = 
Qk, which suggests that the Ghirlanda-Guerra identities can be used in the 
computation of (4.6). Let us start by expanding the product A^+i • • • A„_|_fc. 
Each term in the expansion corresponds to a collection I of n + k disjoint 
sets, /i, . . . , In+k, such that 

{n + 1, . . . , n + A;} = Ii U • • • U In+k 

and such that / describes the fact that we select each £{cr^) for 1 < j < n + 
from factors A/ with indices I ^ Ij- We will call such a collection / a partition 
of {n + 1, . . . , n + A:}, even though some of the sets Ii, . . . , In+k can be empty. 
Therefore, we can write 

(4.7) A„+i • • • ^n+k = cie{a^)\''\ ■ ■ ■ e(a"+'=)l^"+'=l 

for some constants c/ that, of course, depend on the partitions /. Next, for 
any partition P of {1, . . . ,n + k}, let us write 

I l' <;=^ I and I' belong to the same element of P 

and let us denote by Ip = Ip{a^, . . . , a'^'^^) the indicator of the event 

(4.8) Ip = I{a^ = a^' if and only ii I ^p I' ,1 < 1,1' < n + k}. 
Using the fact that 1 = "^plp, let us write, for any partition / in (4.7), 

E^e(cT^)l^il • • • e(a"+'=)l^"+*l = ^/pEee(a^)l^il • • •e(cj"+^')l^"+^-l. 

p 

Each term on the right-hand side is either Ip when 

(4.9) e((Ti)l^il---e(CT"+^')l^"+''-lEEl, 

that is, when, for each set of the partition P, the number of factors £{cr^) 
(with their multiplicities) with indices j inside this set is even, or otherwise, 
since, in this case, at least one independent factor £{cr^) will remain and 

E^e(a^)l^il---e(a"+'^)l^"+'=l =0. 
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Therefore, if we denote by V{I) the coUection of partitions P of the first 
type for which (4.9) holds, then 

(4.10) E(/„E,A„+i---A„+fc) = J^c, Yl Wnlp). 

I PeV(i) 

Since is a function of the overlaps {Ri^v) which takes only a finite num- 
ber of values (1.1), it can be written as a linear combination of indicator 
functions of sets of the type 

(4.11) {Ri^i,=qi^i,:l<l,l' <n] 

for any symmetric nonnegative definite matrix {qi^i')i<i^i'<n with qi^i/ G {qi, . . . , 
qk} and diagonal elements qi^i = qk- Therefore, we can assume that is the 
indicator of the set (4.11). By (3.1) or by (3.2), we can assume that the con- 
straints {qi^i') in (4.11) induce a partition Q on the set {1, . . . ,n} according 
to the rule 

(4.12) / ~Q /' if and only if qi^i, = qk 

and constraints qiji depend on 1,1' only through the partition elements in 
Q which they belong to. If partition Q consists of sets Qi, . . . , Qp and Ij = 
min{/ : / G Qj} is the smallest index in each set, then can be written as 

(4.13) fn = fnlQ, where fl, = I{{Ri,,=qij,:l,l'e{h,...,lp}})- 

In this representation, we separate constraints which describe how coordi- 
nates group together in the partition Q from constraints between represen- 
tatives /i, . . . ,lr of each element of the partition, defined by Note that 
in the definition of all qi^i' ^ qk for / ^ I'. 

Returning to (4.10), if a partition P of {1, . . . , n + A;} does not agree with 
Q on {l,...,n}, then fnip = fnlqlp = 0. This means that in (4.10), we 
can redefine 'P{I) to include only partitions P that agree with Q, that is, 
Iqlp = Ip- For such partitions, we will now compute 

E(/„/p)=E(/;/p) 

slightly more explicitly. Suppose that 

P = PlU---UPpU Pp+i U---UPr 

(we will abuse notation and write a partition as a union of its elements), 
where Pi Ci {1, . . . ,n} = Qi for 1 <l <p and Pi C {n + 1, . . . , n + fc} for p < 
I < r. Of course, it is possible that r = p. Our immediate goal will be to 
demonstrate that the Ghirlanda-Guerra identities imply that 

(4.14) E(/;/p) = $(P)E(/;) 

for some function ^{P) that depends only on the configuration of the parti- 
tion P. The exact formula for ^{P) will not be important, but what will be 
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important is to observe that it does not depend on the constraints in (4.13) 
that define f^. For 1 < j < r, we denote by 

(4.15) lj=mm{l:lePj} 

the smallest index in the set Pj. Obviously, this definition agrees with the 
previous definition of Ij for Qj . Let us consider one of the sets in the partition 
that contains at least two points, for example, Pr- Let I be the largest index 
in Pr and let P' be the restriction of the partition P to the set {1, . . . ,n + k}\ 
{I}. We can then write Jp = /p//(cr''- =cr'). By (3.1), {a^ = a^'} = {Ri^u = qk} 
and we can treat I(a^ = ) as a function of i?/^/' when using the Ghirlanda- 
Guerra identities. Therefore, (3.7) implies that 

The only nonzero terms in the last sum correspond to j £ Pr\{lr,l} and for 
such j, the constraint a''' = is already included in P', so Ip'I{a^'' = a^) = 
Ipi and we get 

Recursively, we can sequentially remove all coordinates with indices in P^., 
except a^^ . If we consider the partition 

P' = PiU---UP,._iU{/,.}, 

then 

(|P,| - 1 - mfc) • • • (|P,| - (|P,| - 1) - mfe) 

We can carry out the same computation on each of the partitions Pi , . . . , Pr-i . 
As a result, if we consider the partition 

(4.16) P' = {h] U • • • U {Ip] U {Ip+i] U • • • U {Ir] 
and denote 

= (IP,- 1 - 1 - mfc) • • • (IP,- 1 - (|P,-| - 1) - ruk) 
for 1 < J < r and = 1 if | P,- 1 = 1 , then 



{n + k-l)---{n + k-{\Pr\-l) (I A| - 1)) 

(4.17) 

xIE(/;/p')- 

Finally, let us simplify E{fl^Ipi). U p = r, then f^Ip' = fn- li p <r, then we 
continue and consider a partition 

(4.18) P" = {h}U---U {Ip} U {Ip+i} U • • • U {Ir-i}. 
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Then Ipi = Ipn — ^^jZllp"I{a'"' = cr'j ) and, using the Ghirlanda-Guerra 
identities, 

r-l 

E(/;/p,) = E(/;/p») - V -E(/;/p»)(l - ruk) = mknfJp")- 

/-^ r — 1 
i=i 

Recursively, we can remove all coordinates to get 

(4.19) E(/;/p,) = mr^E(/;). 

In this last term, we do not need to write the indicator of the partition 
{li} U • • • U {Ip} since these constraints are already contained in the definition 
of Therefore, we have proven (4.14) with 



$(P) = I^V-l^l r-p 

^ ) {n + k-l)---[n + k-{\Pr\-l) (|i^i|-l)) 



and equation (4.10) becomes 

(4.20) E(/„E,A„+i-.-A„+fc) = (5^c, ^ cI>(P)] E(/;). 

^ / P&V(I) ^ 

It seems difficult to show algebraically that Ylii^i^p&v{i) ^(-^) ~ ^- How- 
ever, as mentioned above, one can note that the computation leading to 
(4.20) depends on only through Iq in (4.13) since we only used the 
fact that partitions P G V{I) should agree with Q on {1, . . . ,re}. Therefore, 
(4.20) takes exactly the same form for fn = Iq, for which is the indicator 
corresponding to the partition Qq = {li} U • • • U {Ip}, that is, 

^ I Peril) ^ 

Another way to see this is simply to add up (4.20) for all corresponding 
to the same partition Q. Therefore, since E(/q„) 7^ 0, we will complete the 
proof if we can show that 

E(/QEeA„+i • • • A„+fc) = E(/qA„+i • • • A„+fc) = 0. 

However, this is the /cth derivative of the function (pQ{t) = E{Iq)j^ at t = 
and the result will follow if we can show that ^PQ{t) = ipq{0). The crucial 
observation here is that E,{Iq)t^ depends only on the distribution of the 
sequence (wf) because 



(4.21) cr' = a^' u-^ = w 



provided that all of the weights in (wf) are different with probability 1. By 
(4.2), {wf) has the Poisson-Dirichlet distribution PD{mk) and, therefore. 
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all of the weights wf are different with probability 1, (wf) and (wi) have the 
same distribution and, by (4.21), E(/q)^ = E{Iq). This proves that (pgit) = 
fqiO) and completes the proof of (4.4). 

If R and are Gram-de Finetti matrices generated by the random 
directing measures fj, and fx'^ , respectively, then (4.4) obviously implies that 

v 

R = R^. It remains to prove that 

rEr^ =^ {w,n) = {w'',TV). 

This will follow from the fact that, conditionally on {w,TZ), the matrix R 
is generated by Ri i/ = IZ^i , as in (3.6), from which one can show that 
{u!,TZ) = (j){R) almost surely for some measurable function c^, that is, the 
configuration {w,TZ) of the directing measure can be uniquely reconstructed 
from the overlap matrix R. Note that, by (3.1), with probability 1, the matrix 
R is ultrametric at the level k, in the sense that the relation I I' defined 
by Ri,i' = Qk is an equivalence relation on N, and for any two equivalence 
classes A'"i and N2, the coordinates Ri^i' are equal for all / G A''! and I' G N2- 
Let {R) be the vector of frequencies of the equivalence classes restricted 
to the set {1, . . . ,n}, arranged in decreasing order and then extended to an 
infinite vector by appending all zeros. Let TZ^{R) be the matrix of overlaps 
between the equivalence classes defined by 

(7^"(i^))^,/. = i^,,,• 

for any representatives i and j of the equivalence classes corresponding to 
nonzero w'^{l) and w'^{l') and extended to an infinite matrix by setting q/^ on 
the diagonal and zeros everywhere else. Define 0(i?) as the coordinate-wise 
limit 

(4.22) <^(-R)= lim (w;"(i^),7^"(i^)) 

n— >+oo 

if such limit exists and some fixed value otherwise, including when the matrix 
R is not ultrametric at the level k. Since, conditionally on {w,TZ), the matrix 
R is generated as in (3.6), by the strong law of large numbers, w^{R) con- 
verges almost surely to w coordinate- wise. All coordinates of w are different 
with probability 1 since the sequence {wi) has the Poisson-Dirichlet distri- 
bution. For any such given {w,TZ), TZ'^{R) also converges almost surely 
to TZ coordinate-wise because, asymptotically, each equivalence class corre- 
sponds to a unique ^(i) in the support of fi. This proves that {w,TZ) = (j){R) 
with probability 1 and since, by (4.4), the distribution of R is the same 

under the directing measures /i or we get {w^ ,Tl'^) = {w,Tl). □ 

Finally, it remains to prove that the invariance principle of Theorem 4 
implies exchangeability of the matrix TZ. 
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Proof of Theorem 3. Even though the underlying idea of our proof 
is the same as the idea in Proposition 3.3 in [3], we provide a more direct 
argument based on a very exphcit control of the mixing induced by the ran- 
dom change of density (3.9). Let us start with the observation that Theorem 
4 also holds if we replace a Rademacher sequence in the change of density 
(3.9) by an i.i.d. standard Gaussian sequence (gi). 

Lemma 2. Theorem 4 holds with the change of density 

(4.23) wj 



for arbitrary t > and i.i.d. standard Gaussian (gi). 

Proof. This follows from the fact that the invariance provided by The- 
orem 4 will be preserved if we iterate the process of making the change of 
density (3.9). Namely, if (ef );>! are i.i.d. copies of {ei)i>i for /c > 1 and if 
we define = ej + ■ ■ ■ + Si , then replacing (3.9) with 

the statement of Theorem 4 still holds. We can replace t in (4.24) by tk~^^'^ 
for any fixed t > and large enough k such that tk~^^'^ < 1/2. Each element 
of the i.i.d. sequence {k~^/'^Sf^)i>i converges in distribution to the standard 
Gaussian as — )• +oo. Therefore, we can choose these sequences for all A: > 1 
on the same probability space with some i.i.d. Gaussian sequence {gi)i>i so 
that k~^/'^Si —7- gi almost surely for each /. It is easy to check that the sum 

Wpexptgp a.s., 

p>i p>i 

possibly over some subsequence {k{n)). Then (vj) converges almost surely to 
the sequence (4.23) and, as a result, {w'^,Tl'^) defined in terms of (4.24) also 
converges almost surely to the corresponding configuration defined in terms 
of (gi). Since, by Theorem 4, the distribution of {w'^ ,TZ'^) remains the same 
along this sequence, the distribution of the limiting configuration {■w^,TZ'^) 
defined in terms of (gi) is equal to the distribution of {w,Tl). □ 

Now, given n > 1, let us consider a fixed permutation p of {l,...,n} 
and a measurable subset A C [—1, 1]" . Given m > 1, consider a measurable 
subset B C [0, 1]"*. To prove that conditionally on w the overlap matrix TZ 
is weakly exchangeable, we need to show that 

P(7^^ G A, iwi)i<^ G B), where = (7^p(/),p(//)) 
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does not depend on the permutation p. Without loss of generahty, we can 
assume that m = n, by redefining the sets A and B and permutation p. Also, 
to simplify notation, we will write w £ B instead of {wi)i<m S B. Let vr be a 
permutation of indices induced by the rearrangement of the sequence (4.23), 
that is, w'^ = w^T^^iy If we let 

'^n°'' = i'^TTop{l),nop{l'))l<lJ'<n^ 

then Theorem 4 implies that 

(4.25) P(7^^ eA,weB) = F{ni°P £ A^w"" e B). 

Intuitively, when t goes to infinity, the order of vr(l), . . . ,7r(n) becomes com- 
pletely random because it is determined by the order of logw^^j^i-) +tgTr{i) for 
1 < / < n, which is asymptotically, for t — )■ +00, determined by the order of 
5(7^(1) . Therefore, in the limit, the distribution of TZn°'^, and, thus, 
of TZn, should not depend on p, which means that TZ is weakly exchange- 
able. However, since, a priori, we do not control the dependence of w and TZ, 
turning this intuition into a rigorous argument requires some work. Let us 
denote by j = (j(l), • • • a generic vector with all indices different 

and let vr o p = (vr o p{l), . . . ,7r o p{n)). With this notation, the right-hand 
side of (4.25) can be written as 

Y.n^^'^Aw'^ ^B,'Kop=j) 

(4.26) ^ 

= ^F{ni€A,w^ GB,TTop = j), 
j 

where we have also introduced the notation TZn = i'^j{i),j(i'))i<i,i'<n- Con- 
ditionally on w = (wi), the events {TZn G A} and {w'^ G -B,7r o p = j} are 
independent since the latter depends only on the sequence (gi) and, there- 
fore, 

F{TZieA,w'' G B,TTop = j\w) 

(4.27) 

= F(TZi £ A\w)F{w'' £ B,7rop = j\w). 

If T is another fixed permutation of {1, . . . ,n}, then (4.25), (4.26) and (4.27) 
imply that 

\V{TZP eA,weB)- F{TZl eA,weB)\ 
(4.28) -Yj I l^^^" € B,-Ko p = j\w) 

-Fiw"" G B,TTOT = j\w)\dk{w), 
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where A is the distribution of w. Let us express one of the events Cp = 
{w'^ G-B,7rop = j} in terms of the sequence {gi). If we let 

(4.29) k = jop~^, that is, k{l) = j{p-^{l)) for 1 < ^ < n, 

then, by the definition of vr, the event {vr o p = j} expresses the fact that 
for 1 < / < n, the number w^i-jexptg^i-j occupies the position / among all 
the elements of (wiexptgi) arranged in decreasing order. If we introduce the 
notation 



x = suY>{gi + t~'^\ogWi), y = ^ 



then the event Cp can be written as 



I W + Ljl<i<n^ / l</<n 

Let us first consider the probability of Cp conditionally on w and {gi)i^j, 
that is, for fixed x, y and (7fc(/)). Since (zi) are independent and zi has normal 
distribution iV(7fc(;), 1), we can write 

^{Cp\w, {gi)i^j) = J- / exp [ ~ 7fe(/))^ ] dzi ■ ■ ■ dzn 




/ exp(^7fc(o^;-^5^^?) 
■"^p \i=i 1=1 / 



dzi • ■ • dZn ■ 



Since the event (4.30) does not explicitly depend on p, the last integral 
depends on p only through the term X^^Ll7fc(^)^^ denote by (7^") 

and (7^"^) the sequence (7fc{i)) arranged in decreasing and increasing order, 
respectively, then, on the event Cp (since zi> ■ • • > Zn), 

n n n 

1=1 1 = 1 1 = 1 

Therefore, the probability ¥[Cp\w, {gi)i^j) is maximized on the permutation 
p for which the sequence k{l) = j{p~^{l)) in (4.29) is increasing, that is, p 
and j are similarly ordered. This is, obviously, equivalent to vroe = j"*", where 
e is the identity permutation, e{l) = I, and j'"*" is the increasing rearrangement 
of j. Similarly, ¥{Cp\w, {gi)i^j) is minimized on the permutation p for which 
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the sequence in (4.29) is decreasing, which is equivalent to vr o e' = for 
the inverse permutation, e'(/) = n — I + 1. Averaging over {gi)i^j, we have 
proven that 

Fiw"" G B,TToe' =j^\w) <¥{w'' G B,TTop = j\w) 

<F{w'' eB,TToe = j^\w) 

and, therefore, 

\¥{w'' G B,TTOp = j\w) - F{w'^ G B,TTOT = j\w)\ 

< Fiw"" eB,TToe = j+ lu)) - Fiw"" G vr o e' = j^\w) 
for any p,T and j. Plugging this into (4.28) gives 

|P(7^^ €A,weB)- P(7^; eA,weB)\ 

(4.31) 

< ¥{w'' G 3j : vr o e = j+) - P(i(;'' e B,3j :tt o e' = i+). 

We divide by n! because each j"^ corresponds to n\ different j. It remains to 
show that the right-hand side goes to zero when t in (4.23) goes to infinity. 
Let us recall the definition wf = ■w'^(i) and let us similarly define gf = g^^^i^ . 

The event {3j : vr o e = j"^} can then be expressed in terms of {w^,g^), as 
follows. On one hand, this event simply means that vr(l) < • • • < vr(n). On 
the other hand, (4.23) implies that, if we let k = X]p>i ^p^*^''' 

and, therefore, 

(4.32) {3j:^oe = i+} = {<e-*^'i >---><e-*f"}. 

By (4.2), (wf) has the Poisson-Dirichlet distribution A = PD[mi^) and it is 
easy to check that (4.3) and (4.1) imply that {gj) is an i.i.d. sequence with 
normal distribution v = N(tmk,l), independent of (wf). Therefore, 

P(w;''GS,3j:vroe=j+) 

= / zv®«((5r)i<K„:«;[e-*^'^ >---><e-*fn)dA(«;-). 
Jb 

For any fixed w'^ , v®'^{w1e~*3'^ > ••• > u;^e~*^") — 1/n! when t — )• +oo 
since, asymptotically, this event is equivalent to 5i < • • • < ff^ and, there- 
fore, ¥{w'^ £ B,3j -.TT o e = j"*") — )■ A{B) as t — )• +oo. Similarly, the fact that 
F{'w'^ G B, 3j : vroe' = j"*") — >• A{B), together with (4.31), completes the proof. 
□ 
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